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DETAILED ACTION 
Response to Arguments 

1 . Applicants arguments witli respect to claims 1 , 6, and 1 1 have been considered 
but are moot in view of the new grounds of rejection. In light of the Remarks filed 
01/20/2009 and in light of the specification, Examiner has incorporated Okada US 
5889481 A (hereinafter Okada) to address "linguistic sorting is determined based on a 
compression type of the given compression, a first of the two or more symbols in the 
given compression and a predefined order of symbols", wherein Okada appears to 
teach this limitation for multiple languages. Further, though Lisle in view of Katayama 
teach the ordering of multiple dictionaries, Okada has been introduced to address the 
compression of multiple languages through the use of multiple compression units. 

NOTE: Examiner would like to remind Applicant of the following: 

"USPTO personnel are to give claims Weir broadest reasonable interpretation in 
light of the supporting disclosure. In re Morris, 127 F.3d 1048, 1054-55, 
44 USPQ2d 1023,1027-28 (Fed. Cir. 1997). Limitations appearing in the 
specification but not recited in the claim should not be read into the claim. E-Pass 
Techs., Inc. v. 3Com Corp., 343 F.3d1364, 1369, 67 USPQ2d 1947, 1950 (Fed 
Cir 2003) (claims must be interpreted "in view of the specification" without 
importing limitations from the specification into the claims unnecessarily). In re 
Prater, 415F.2d 1393, 1404-05, 162 USPQ 541, 550-551 (CCPA 1969). See 
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also In re Zletz, 893 F.2d 319, 321-22, 13 USPQ2d 1320, 1322 (Fed. Cir. 1989) 
("During patent examination the pending claims must be interpreted as broadly 
as their terms reasonably allow.... The reason is simply that during patent 
prosecution when claims can be amended, ambiguities should be recognized, 
scope and breadth of language explored, and clarification imposed.... An 
essential purpose of patent examination is to fashion claims that are precise, 
clear, correct, and unambiguous. Only in this way can uncertainties of claim 
scope be removed, as much as possible, during the administrative process."). 
Where an explicit definition is provided by the applicant for a term, that definition 
will control interpretation of the term as it is used in the claim. Toro Co. v. White 
Consolidated Industries Inc., 199 F.3d 1295, 1301, 53 USPQ2d 1065, 1069 (Fed. 
Cir. 1999) (meaning of words used in a claim is not construed in a "lexicographic 
vacuum, but in the context of the specification and drawings."). Any special 
meaning assigned to a term "must be sufficiently clear in the specification that 
any departure from common usage would be so understood by a person of 
experience in the field of the invention." Multiform Desiccants Inc. v. Medzam 
Ltd, 133 F.3d 1473, 1477, 45 USPQ2d 1429, 1432 (Fed. Cir. 1998). See also 
MPEP §2111.01." 

Examiner maintains tine use of Lisle, wlierein Lisle in view of Katayama and 
Okada read upon the claim language and is directly in the scope of the claims (i.e. 
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compression, sorting, and searching or Unicode text data). A compression table is 
construed to be functionally equivalent and equally effective to that of a dictionary or list, 
where both contain information relative to the sorting of a language in memory. A table, 
dictionary, corpus, look up table, etc. is merely a database that is referenced, wherein 
the table/dictionary is specific to sorting. Lisle clearly demonstrates the well known use 
of a binary search and ranking of dictionaries/tables, wherein Lisle teaches that multiple 
dictionaries may be used and in the general case, each dictionary may contain up to j 
segments. A given segment for a given dictionary D will thus be defined by D(i,j) and 
can be invoked by a particular bit pattern resident within the control table. The sum of 
the number of segments employed for all dictionaries used in a given compression 
example must be reserved in individual bit patterns within the control table to identify the 
segments as they occur in the compressed text string. That is, there will be one bit 
pattern in the control table for each segment of each identified dictionary that is 
employed in the compressed text string. Thus, the number of word or phrases 
represented by a single byte in the compressed data set will be the total number 
possible in a single byte, 256 less the total number of all segments that must have 
identifiers reserved for them in the control table and less the number of control bit 
patterns set aside within the control table for special characters, letters, numbers and 
control sequences or signals. By an arbitrary convention employed herein, the number 
of bit patterns within the control table used to identify dictionary segments is assigned 
sequentially starting at the top of the control table, i.e., with a bit pattern of all 1's and 
proceeding to lower numbered bit patterns until the total number of dictionary segments 
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have all been assigned. Any remaining control table spaces may then be assigned for 
the control bit patterns, special characters, letters, numbers and the like and any 
remaining spaces can be given over to the highest figure of merit words (Col. 13 lines 
24-34). 

Further, Lisle teaches the relative weighted figure of merit for each word in the 
document is computed by multiplying the count of the number of occurrences of each 
word by the length in characters of the word. The resulting figures of merit are sorted 
by magnitude with the highest magnitude (highest figure of merit words) being assigned 
to bit positions that remain within the single byte table (after assignment of alphas, 
numerics, special characters and the control characters has been completed). The user 
will have to define how many bit patterns will be available in a single byte table by a 
judicious selection of the control characters, etc. This will leave a certain number out of 
the 256 possible patterns available. A unique header is then written in box 28 to begin 
the compressed data stream. This is followed as shown in box 29 by the segment 
memory location map, a 2 byte pattern indicating the length of all the words that are 
represented in the single byte table from box 26, and followed by the actual list of words 
to be used in the dictionary as shown in box 31 (Col. 19 lines 20-34). 

Furthermore, like the present invention. Lisle clearly teaches that a processor 4 
will construct indices to minimize the dictionary search time. These may have been 
previously constructed and read in at the time the dictionaries themselves are loaded 
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from bulk storage 8. The index set consists of a set of members, each member of 
which will contain the segment number which is reserved in the control table to be 
defined for that segment. The index member has two entries for the length of a word 
and the word itself for the lowest or first entry of that dictionary segment in the collation 
order of words being used. The index member also has two entries for the last or 
highest collation order word entry in that segment for the word's length and for the word 
itself. In other words, the dictionary segment index contains, for each segment to be 
defined, a length and an actual word entry for the lowest order collation entry appearing 
within that segment alphabetically and a length and actual word entry for the highest 
collation order entry appearing within that segment. The collation order may usually be 
alphabetic. An example collation order is shown in FIG. 2. The collation order assumed 
for the example of FIG. 2 is that used in the IBM System/370 computer architecture. 
This is an assigned hierarchical sorting collation order with special characters first in a 
defined order that is known to users of such systems, followed by the alphabet upper 
and lower case and last, by the numerals in the highest collation order of sequence. 
The collation order may be viewed as equivalent to an overall "alphabetic order" for the 
possible entries to be sorted. The actual dictionary entries for each dictionary are thus 
collated first and sorted into the collation order. Each dictionary segment thus begins 
with some low collation order entry of a given length and a given entry word (or number 
or character as the case may be) and the segment index ends with the highest collation 
order entry that appears within that segment of the dictionary being used. The 
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dictionary segment index is used to speed dictionary search time using binary search 
techniques as will be described (Col. 15 lines 23-63 & Fig. 2). 

Additionally, as cited in the previous office action: 

"Katayama et al. US 6260051 B1 US 5550541 A (hereinafter Katayama) 
has been introduced to address the amendment(s) to the claims. Katayama 
teaches a registration two-character chain table producing unit 194 for producing 
a first table block , in which a plurality of registration first and second two- 
character chains respectively including the same type of for general character 
and the position numbers of the registration first and second two-character 
chains are listed in the order of arranging the chains in the converted registration 
character string, for each fore general character type, producing a second table 
block, in which a plurality of registration special two-character chains respectively 
including the same type of fore symbolic character and the position numbers of 
the registration special two-character chains are listed in the order of arranging 
the chains in the converted registration character string, for each fore symbolic 
character type, and combining each first table block corresponding to one type of 
fore general character and one second table block corresponding to one type of 
fore symbolic character determined in correspondence to the type of the fore 
general character to form a two-character chain table for each character group, 
the fore characters of the chains in each two-character chain table belonging to 
the same character group (Katayama Col. 130 lines 33-53). 



Application/Control Number: 1 0/691 ,424 Page 8 

Art Unit: 2626 

Further, Katayama teaches a character chain collating and judging unit 
200 for receiving the position numbers of one particular two-character chain Tc1 
from the storing unit 195 just after the reception of the position numbers of 
another particular two-character chain Tc2 under the control of the control unit 
1 99. (First collation case), collating each position number of a particular second 
two-character chain Tel with a particular position number of a particular first two- 
character chain Tc2 to judge whether or not each position number of the 
particular second two-character chain Tel agrees with the particular position 
number of the particular first two-character chain Tc2 (second collation case), 
collating each position number of a particular special two-character chain Tc1 
with a particular position number of a particular first two-character chain Tc2 to 
judge whether or not each position number of the particular special two-character 
chain Tel is higher than the particular position number of the particular first two- 
character chain Tc2 by one (third collation case), collatinc each position number 
of a particular special two-character chain Tel with a particular position number 
of a particular second two-character chain Tc2 to judge whether or not each 
position number of the particular special two-character chain Tel is higher than 
the particular position number of the particular second two-character chain Tc2 
by two (fourth collation case), collating each position number of a particular first 
two-character chain Tel with a particular position number of a particular special 
two-character chain Tc2 to judge whether or not each position number of the 
particular first two-character chain Tel is higher than the particular position 
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number of the particular special two-character chain Tc2 by one (fifth collation 
case), and detecting a particular position number of a particular two-character 
chain of the particular two-character chain table Tc1 for each collation case 
(Katayama Col. 131 lines 27-67). 

Additionally, Examiner takes Official Notice that it is well known to compress 
input characters/symbols into a different representation such as the dictionary methods 
(LZX, LZ78, LZFG, LZRW1, LZRW4, LZW, LZMW, LZAP, LZY,LZ, and other variants of 
dictionary compression methods taught by "David Solomon, Data Compression - Tlie 
Complete Reference, 4^ Ed., Pages 172-224 & Tables 3.25 and 3.26" demonstrate 
input and output variants,- wherein single symbols/characters are sorted and 
represented as a combination of symbols/characters." 

Claim Rejections - 35 (JSC § 103 

2. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the phor art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claim 1, 2, 5-7, 10-12, and 15-17 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Lisle et al US 4,843,389 (hereinafter Lisle) in view of Katayama et al. 
US 6260051 B1 US 5550541 A (hereinafter Katayama) and further in view of Okada US 
5889481 A (hereinafter Okada). 
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Re claims 1 and 6, Lisle teaches a computer-readable medium having computer- 
executable instructions for performing steps for building a symbol table for storing sort 
weights for a plurality of linguistic symbols used in a plurality of languages supported by 
a computer system (Col. 15 lines 45-63), comprising: 

constructing the symbol table (Col. 19 lines 36-59) to contain a list of code points 
(Col. 20 lines 35-56) each uniquely identifying one of the symbols, and a sort weight for 
the symbol identified by said each code point (Col. 15 lines 45-63); 

providing a plurality of compression tables, each compression table pertaining to 
one of the supported languages and having a compression type and containing 
compressions of symbols of that compression type 

for each code point in the symbol table (Col. 20 lines 35-56), sorting the 
compression tables (Col. 19 lines 36-59) to identify a highest compression type our 
compressions beginning with the symbol (Col. 15 lines 45-63) identified by said each 
code point (Col. 20 lines 35-56); 

storing in the symbol table a tag for each code point to indicate said highest 
compression type for said each code point (Col. 20 lines 35-56). 

NOTE: Tagging a code point is construed to be both functionally equivalent and 
equally effective as ranking or ordering a code point or address in memory for the 
purposes of a hierarchical classification. 

However, Lisle fails to teach each compression being a grouping of two or more 
symbols treated as a single unit for purposes of linguistic sorting and the compression 
type identifying a number of symbols in a given compression 
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Katayama teaches a registration two-character chain table producing unit 194 for 
producing a first table block, in which a plurality of registration first and second two- 
character chains respectively including the same type of for general character and the 
position numbers of the registration first and second two-character chains are listed in 
the order of arranging the chains in the converted registration character string, for each 
fore general character type, producing a second table block, in which a plurality of 
registration special two-character chains respectively including the same type of fore 
symbolic character and the position numbers of the registration special two-character 
chains are listed in the order of arranging the chains in the converted registration 
character string, for each fore symbolic character type, and combining each first table 
block corresponding to one type of fore general character and one second table block 
corresponding to one type of fore symbolic character determined in correspondence to 
the type of the fore general character to form a two-character chain table for each 
character group, the fore characters of the chains in each two-character chain table 
belonging to the same character group (Katayama Col. 130 lines 33-53). 

Further, Katayama teaches a character chain collating and judging unit 200 for 
receiving the position numbers of one particular two-character chain Tc1 from the 
storing unit 1 95 just after the reception of the position numbers of another particular 
two-character chain Tc2 under the control of the control unit 199. (First collation case), 
collating each position number of a particular second two-character chain Tel with a 
particular position number of a particular first two-character chain Tc2 to judge whether 
or not each position number of the particular second two-character chain Tel agrees 
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with the particular position number of the particular first two-character chain Tc2 
(second collation case), collating each position number of a particular special two- 
character chain Tc1 with a particular position number of a particular first two-character 
chain Tc2 to judge whether or not each position number of the particular special two- 
character chain Tc1 is higher than the particular position number of the particular first 
two-character chain Tc2 by one (third collation case), collating each position number of 
a particular special two-character chain Tc1 with a particular position number of a 
particular second two-character chain Tc2 to judge whether or not each position number 
of the particular special two-character chain Tc1 is higher than the particular position 
number of the particular second two-character chain Tc2 by two (fourth collation case), 
collating each position number of a particular first two-character chain Tc1 with a 
particular position number of a particular special two-character chain Tc2 to judge 
whether or not each position number of the particular first two-character chain Tc1 is 
higher than the particular position number of the particular special two-character chain 
Tc2 by one (fifth collation case), and detecting a particular position number of a 
particular two-character chain of the particular two-character chain table Tc1 for each 
collation case (Katayama Col. 131 lines 27-67). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify the system of Lisle to incorporate compression being a 
grouping of two or more symbols treated as a single unit for purposes of linguistic 
sorting and the compression type identifying a number of symbols in a given 
compression as taught by Katayama to allow for control of several letters/symbols within 
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a character chain, wherein a table is used to track, judge, and find the position and 
number of symbols present (Katayama Col. 130 lines 33-53). 

However, Lisle in view of Katayama fails to teach 

providing a plurality of compression tables, each compression table pertaining to 
one of the supported languages and having a compression type and containing 
compressions of symbols of that compression type 

linguistic sorting determined based on a compression type of the given 
compression, a first of the two or more symbols in the given compression and a 
predefined order of symbols 

Okada teaches discriminating the kind of language, wherein a separating unit 32 
per language is provided for the language string separating unit 12. On the basis of the 
discrimination result of the language by the row octet decoder 30, the separating unit 32 
per language separates the input Unicode data into each language string such as Latin 
(English), Greek, or the like. A compressing unit corresponding to each language 
allocated for the Unicode is individually provided for the language string compressing 
unit 14. In the embodiment, a Latin compressing unit 34, a Greek compressing unit 36, 
a Hangul compressing unit 38, a Kanji compressing unit 40, and the like are provided. 
As a compressing unit per language which is provided for the language string 
compressing unit 14, it is sufficient to properly decide the compressing unit in 
accordance with the language which is treated in the Unicode source data as a 
compression target. The compression data per language compressed by each of the 
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Latin compressing unit 34, Greel^ compressing unit 36, Hangul compressing unit 38, 
and Kanji compressing unit 40 is unified by a code unifying unit 42 and the unified data 
is outputted as compression data. As a compressing method of each compressing unit 
per language provided for the language string compressing unit 14, a plurality of 
dictionary memories corresponding to the languages are provided and there is executed 
a Ziv-Lempel encoding for encoding by a longest coincidence retrieval of the character 
string which is inputted per data of the language string and the character string which 
has already been registered in the dictionary for every language. In the Ziv-Lempel 
encoding, any one of the dynamic dictionary method and the slide dictionary method 
can be used. As another compressing method, for the character string separated every 
language, on the basis of a probability table per language string obtained until now, the 
character string which is inputted every data can be also multi-value arithmetic 
encoded. The source data of the Unicode in which different languages mixedly exist is 
separated every language and is individually compressed, so that the compression of 
each character string in which statistic natures are similar is executed. A compressing 
unction in the Ziv-Lempel encoding, arithmetic encoding, or the like is effectively used 
and a high compression ratio can be realized (Okada Col. 12 line 26 -Col. 13 line 8 & 
Fig. 17 and 18 compression, decompression). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify the system of Lisle in view of Katayama to incorporate 
providing a plurality of compression tables, each compression table pertaining to one of 
the supported languages and having a compression type and containing compressions 
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of symbols of that compression type and linguistic sorting determined based on a 
compression type of the given compression, a first of the two or more symbols in the 
given compression and a predefined order of symbols as taught by Okada to allow for 
the distinguishing between multiple languages mixed or separately in a Unicode 
environment dealing with compression, decompression, and sorting to create a high 
compression ratio/type depending on the language (i.e. grammar), wherein 
compression ratios vary for each language in reference to a already existing language 
(Okada Col. 12 line 26 -Col. 13 line 8 & Fig. 17 and 18 compression, decompression). 

Re claims 1 1 and 16, Lisle teaches a computer-readable medium having 
computer-executable instructions for performing steps for a computer search program to 
carry out a linguistic sorting operation (Col. 15 lines 45-63, comprising: 

receiving an input string containing a plurality linguistic symbols (Col. 6 lines 42- 
58) used in a given language (Col. 15 lines 45-63); 

for a first symbol in a combination of symbols in the input string (Col. 15 lines 45- 
63), referencing a symbol table (Col. 20 lines 35-56) to obtain a highest compression 
type for compressions beginning with said first symbol, the symbol table having a list of 
code points each uniquely identifying a symbol and a sort weight for the symbol 
identified by said each code point; 

performing a binary search (Col. 16 lines 6-27) through each of a plurality of 
compression tables (Col. 19 lines 36-59) containing compressions for the given 
language to find a matching compression that matches said combination of symbols in 
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the input string (Col. 16 lines 6-27), wherein the plurality of compression tables are 
searched in a descending order (Col. 15 lines 45-63) of compression types of the 
compression tables (Col. 19 lines 36-59) starting with a compression table having a 
compression type equal to said highest compression type for said first symbol (Col. 15 

lines 45-63). 

NOTE: Tagging a code point is construed to be both functionally equivalent and 
equally effective as ranking or ordering a code point or address in memory for the 
purposes of a hierarchical classification. 

However, Lisle fails to teach each compression being a grouping of two or more 
symbols treated as a single unit for purposes of linguistic sorting and the compression 
type identifying a number of symbols in a given compression 

Katayama teaches a registration two-character chain table producing unit 194 for 
producing a first table block, in which a plurality of registration first and second two- 
character chains respectively including the same type of for general character and the 
position numbers of the registration first and second two-character chains are listed in 
the order of arranging the chains in the converted registration character string, for each 
fore general character type, producing a second table block, in which a plurality of 
registration special two-character chains respectively including the same type of fore 
symbolic character and the position numbers of the registration special two-character 
chains are listed in the order of arranging the chains in the converted registration 
character string, for each fore symbolic character type, and combining each first table 
block corresponding to one type of fore general character and one second table block 
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corresponding to one type of fore symbolic character determined In correspondence to 
the type of the fore general character to form a two-character chain table for each 
character group, the fore characters of the chains In each two-character chain table 
belonging to the same character group (Katayama Col. 130 lines 33-53). 

Further, Katayama teaches a character chain collating and judging unit 200 for 
receiving the position numbers of one particular two-character chain Tel from the 
storing unit 1 95 just after the reception of the position numbers of another particular 
two-character chain Tc2 under the control of the control unit 199. (First collation case), 
collating each position number of a particular second two-character chain Tel with a 
particular position number of a particular first two-character chain Tc2 to judge whether 
or not each position number of the particular second two-character chain Tel agrees 
with the particular position number of the particular first two-character chain Tc2 
(second collation case), collating each position number of a particular special two- 
character chain Tc1 with a particular position number of a particular first two-character 
chain Tc2 to judge whether or not each position number of the particular special two- 
character chain Tel Is higher than the particular position number of the particular first 
two-character chain Tc2 by one (third collation case), collating each position number of 
a particular special two-character chain Tel with a particular position number of a 
particular second two-character chain Tc2 to judge whether or not each position number 
of the particular special two-character chain Tel Is higher than the particular position 
number of the particular second two-character chain Tc2 by two (fourth collation case), 
collating each position number of a particular first two-character chain Tel with a 
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particular position number of a particular special two-character chain Tc2 to judge 
whether or not each position number of the particular first two-character chain Tc1 is 
higher than the particular position number of the particular special two-character chain 
Tc2 by one (fifth collation case), and detecting a particular position number of a 
particular two-character chain of the particular two-character chain table Tc1 for each 
collation case (Katayama Col. 131 lines 27-67). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify the system of Lisle to incorporate compression being a 
grouping of two or more symbols treated as a single unit for purposes of linguistic 
sorting and the compression type identifying a number of symbols in a given 
compression as taught by Katayama to allow for control of several letters/symbols within 
a character chain, wherein a table is used to track, judge, and find the position and 
number of symbols present (Katayama Col. 130 lines 33-53). 

However, Lisle in view of Katayama fails to teach a highest compression type for 
compressions beginning with said first symbol, the symbol table having a list of code 
points each uniquely identifying a symbol and a sort weight for the symbol identified by 
said each code point for a given language. 

Okada teaches discriminating the kind of language, wherein a separating unit 32 
per language is provided for the language string separating unit 12. On the basis of the 
discrimination result of the language by the row octet decoder 30, the separating unit 32 
per language separates the input Unicode data into each language string such as Latin 
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(English), Greek, or the like. A compressing unit corresponding to each language 
allocated for the Unicode is individually provided for the language string compressing 
unit 14. In the embodiment, a Latin compressing unit 34, a Greek compressing unit 36, 
a Hangul compressing unit 38, a Kanji compressing unit 40, and the like are provided. 
As a compressing unit per language which is provided for the language string 
compressing unit 14, it is sufficient to properly decide the compressing unit in 
accordance with the language which is treated in the Unicode source data as a 
compression target. The compression data per language compressed by each of the 
Latin compressing unit 34, Greek compressing unit 36, Hangul compressing unit 38, 
and Kanji compressing unit 40 is unified by a code unifying unit 42 and the unified data 
is outputted as compression data. As a compressing method of each compressing unit 
per language provided for the language string compressing unit 14, a plurality of 
dictionary memories corresponding to the languages are provided and there is executed 
a Ziv-Lempel encoding for encoding by a longest coincidence retrieval of the character 
string which is inputted per data of the language string and the character string which 
has already been registered in the dictionary for every language. In the Ziv-Lempel 
encoding, any one of the dynamic dictionary method and the slide dictionary method 
can be used. As another compressing method, for the character string separated every 
language, on the basis of a probability table per language string obtained until now, the 
character string which is inputted every data can be also multi-value arithmetic 
encoded. The source data of the Unicode in which different languages mixedly exist is 
separated every language and is individually compressed, so that the compression of 
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each character string in which statistic natures are similar is executed. A compressing 
unction in the Ziv-Lempel encoding, arithmetic encoding, or the like is effectively used 
and a high compression ratio can be realized (Okada Col. 12 line 26 -Col. 13 line 8 & 
Fig. 17 and 18 compression, decompression). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify the system of Lisle in view of Katayama to incorporate a 
highest compression type for compressions beginning with said first symbol, the symbol 
table having a list of code points each uniquely identifying a symbol and a sort weight 
for the symbol identified by said each code point for a given language as taught by 
Okada to allow for the distinguishing between multiple languages mixed or separately in 
a Unicode environment dealing with compression, decompression, and sorting to create 
a high compression ratio/type depending on the language (i.e. grammar), wherein 
compression ratios vary for each language in reference to a already existing language 
(Okada Col. 12 line 26 -Col. 13 line 8 & Fig. 17 and 18 compression, decompression). 

Re claim 12, Lisle teaches a computer-readable medium as in claim 11, wherein 
the compressions in each of the compression tables (Col. 19 lines 36-59) are sorted 
according to code points for symbols forming the compressions (Col. 15 lines 45-63). 

Re claim 2, 7, and 15, Lisle in view of Katayama fails to teach the computer- 
readable medium as in claim 1 , wherein the code points are assigned to the symbols 
according to the Unicode standard. 
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Okada teaches discriminating the kind of language, wherein a separating unit 32 
per language is provided for the language string separating unit 12. On the basis of the 
discrimination result of the language by the row octet decoder 30, the separating unit 32 
per language separates the input Unicode data into each language string such as Latin 
(English), Greek, or the like. A compressing unit corresponding to each language 
allocated for the Unicode is individually provided for the language string compressing 
unit 14. In the embodiment, a Latin compressing unit 34, a Greek compressing unit 36, 
a Hangul compressing unit 38, a Kanji compressing unit 40, and the like are provided. 
As a compressing unit per language which is provided for the language string 
compressing unit 14, it is sufficient to properly decide the compressing unit in 
accordance with the language which is treated in the Unicode source data as a 
compression target. The compression data per language compressed by each of the 
Latin compressing unit 34, Greek compressing unit 36, Hangul compressing unit 38, 
and Kanji compressing unit 40 is unified by a code unifying unit 42 and the unified data 
is outputted as compression data. As a compressing method of each compressing unit 
per language provided for the language string compressing unit 14, a plurality of 
dictionary memories corresponding to the languages are provided and there is executed 
a Ziv-Lempel encoding for encoding by a longest coincidence retrieval of the character 
string which is inputted per data of the language string and the character string which 
has already been registered in the dictionary for every language. In the Ziv-Lempel 
encoding, any one of the dynamic dictionary method and the slide dictionary method 
can be used. As another compressing method, for the character string separated every 
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language, on the basis of a probability table per language string obtained until now, the 
character string which is inputted every data can be also multi-value arithmetic 
encoded. The source data of the Unicode in which different languages mixedly exist is 
separated every language and is individually compressed, so that the compression of 
each character string in which statistic natures are similar is executed. A compressing 
unction in the Ziv-Lempel encoding, arithmetic encoding, or the like is effectively used 
and a high compression ratio can be realized (Okada Col. 12 line 26 -Col. 13 line 8 & 
Fig. 17 and 18 compression, decompression). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify the system of Lisle in view of Katayama to incorporate a 
Unicode standard for assigning code points to symbols as taught by Okada to allow for 
the distinguishing between multiple languages mixed or separately in a Unicode 
environment dealing with compression, decompression, and sorting to create a high 
compression ratio/type depending on the language (i.e. grammar), wherein 
compression ratios vary for each language in reference to a already existing language 
(Okada Col. 12 line 26 -Col. 13 line 8 & Fig. 17 and 18 compression, decompression). 

Re claim 1 7, Lisle teaches the computer-readable medium as in claim 1 1 , having 
further computer-executable instructions for storing a sort weight (Col. 15 lines 45-63) 
for the matching compression (Col. 16 lines 6-27). 
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Re claims 5 and 10, Lisle teaches the computer-readable medium as in claim 1 , 
further comprising computer-executable instructions for performing steps of sorting 
compressions (Col. 15 lines 45-63) in each of the compression tables based on 
combinations of code points (Col. 20 lines 35-56) of the compressions in said each 
compression table (Col. 19 lines 36-59). 

4. Claims 3, 4, 8, 9, 13, and 14, are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Lisle et al US 4,843,389 (hereinafter Lisle) in view of Katayama 
et al. US 5550541 A (hereinafter Katayama) and Okada US 5889481 A (hereinafter 
Okada) and further in view of Edberg 5,873,111 A (hereinafter Edberg). 

Re claims 3, 8, and 14, Lisle teaches sort weight of the symbol (Col. 15 lines 45- 
63) identified by said each code point (Col. 20 lines 35-56). 

However Lisle in view of Katayama and Okada fails to teach the computer- 
readable medium as in claim 1 , wherein the tag for each code point is stored as a 
portion 

Edberg teaches character attributes that may be organized in a particular 
collation order such that information located earlier in the list indicate a higher priority 
level of significance. For example, if "number" comes before "letter" in the order of the 
character attributes in class 40, then any number will be collated before any letter, such 
that "10" will be listed before "apple" in a list of information which has been collated by 
the sample ordering of category 32a. Alternatively, the character attributes 46 may be 
tagged with a prefix 43. The lower the prefix 43 of a character attribute 46, the earlier it 
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places in the collation order. For example, in the Unicode category 32c, Latin letters 
would list before Cyrillic letters in a collation order (Edberg Col. 12 lines 7-12). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify the system of Lisle in view of Katayama and Okada to 
incorporate the tag for each code point stored as a portion as taught by Edberg to allow 
for proper ordering and collation of characters, wherein prefixes are considered in a 
language specific text (i.e. Unicode and/or Latin) (Edberg Col. 12 lines 7-12). 

Re claims 4 and 9, Lisle teaches computer-readable medium as in claim 3, 
wherein the sort weight of the symbol identified by said each code point (Col. 20 lines 
35-56) comprises a case weight value (Col. 15 lines 45-63), and wherein the tag for said 
each code point is stored as part of the case weight value for said each code point (Col. 
20 lines 35-56). 

Re claims 13, Lisle teaches computer-readable medium as in claim 12, wherein 
each code point in the symbol table includes a tag indicating a highest compression 
type (Col. 19 lines 36-59) for said each code point (Col. 20 lines 35-56), and wherein 
said step of referencing retrieves the tag for the code point identifying said first symbol 
(Col. 15 lines 45-63). 
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Conclusion 

Any inquiry concerning this communication or earlier communications from tine 
examiner should be directed to Michael C. Colucci whose telephone number is (571)- 
270-1847. The examiner can normally be reached on 9:30 am - 6:00 pm, Monday- 
Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571 )-272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Michael C Colucci/ 
Examiner, Art Unit 2626 
Patent Examiner 
AU 2626 
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/Richemond Dorvil/ 

Supervisory Patent Examiner, Art Unit 2626 



